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Abstract: We present a number of bounds on convergence time for two elitist population-based Evolutionary Algorithms 
using a recombination operator k-Bit-Swap and a mainstream Randomized Local Search algorithm. We study 
the effect of distribution of elite species and population size. 



1 INTRODUCTION 



1.2 Definitions and Assumptions 



The main objective of this article is to derive convergence 
properties of two elitist Evolutionary Algorithms (EAs) 
on OneMax and Royal Roads test functions. One of the 
analyzed algorithms uses the k-Bit-Swap (kBS) operator 
introduced in ( |Ter-Sarkisov et al., 2010| l. We compare 
our results to computational findings and other research. 

A population consists of a set of solution strings. 
We split them into two groups: there are elite strings, 
which have the same highest fitness value and the 
remaining non-elite strings. We use the standard notation 
for the population fi, recombination pool A,, the elite 
species a, the non-elite p. 



1.1 Past work 

Recently (1 + 1)EA with i flip probability (n being the 
length of a chromosome) became a matter of extensive 
investigation. Sharp lower and upper bounds for One- 



Max and general linear functions were found in (Doerr 
et al., 2010c| |Doerr et al., 2010al |Doerr et al., 201 1| 
Droste et al., 2002) applying drift analysis and potential 
functions. Specifically, in ( [Doerr et al., 201 l) the upper 
bound for (1 + 1)EA solving OneMax was derived to 
be (1 +o(l))1.39e«logn and in (|Doerr et al„ 2010a i 



the lower bound for the same setting was found to be 
(1 — o(l))enlogn. Drift (a form of super martingale) 



was introduced in (Hajek, 1982 He and Yao, 2003 He 
land Yao, 2004| . 



We analyze two fitness functions here, OneMax (simple 
counting 1 's test function) and Royal Roads (see Section 
Hlfor additional definitions for it). The fitness of a pop- 
ulation is defined as the fitness of an elite string. Since 
both functions have global solution at n, we are interested 
in the following time parameter: 



%A = min{f > : f(a) — n} 



(1) 



that is, the minimum time when (for algorithm A) the 
best species in the population reaches the highest fitness 
value. Since the analysis is probabilistic, we need the 
expectation of this parameter: Et^. 

We assume that we do not need a large number of 
species for evolution. Though this sounds a bit vague, 
this justifies the choice of distributions with respective 
parameters. The expectation of Poisson random variable 
used here is 1 , for Uniform it is . We use the latter 
due to its simplicity. 

We restrict our attention only to elite pairs (kBS) 
or parents (RLS), to simplify the analysis, since other- 
wise we would have to make more assumptions about 
the fitness of non-elite parents p. 



1.3 k-Bit-Swap Operator 

This genetic recombination operator (see Figure [TJ was 
introduced in (Ter-Sarkisov et al., 2010! and proved to 



work efficiently both alone and together with mainstream 
operators (crossovers and mutation). Its efficiency was 
mostly visible on functions like Rosenbrock, Ackley, 



Randomly selectthis bit Swap this bit 

Figure 1: 1 -Bit-Swap Operator 



Algorithm 1:(^ + X)EA 1BS 


1 


Initialize population size /j 




repeat until condition fulfilled: 


2a 


select A, species from the population using 




Tournament selection 


2b 


apply IBS operator to each pair in the 




recombination pool 


2c 


keep a best species in the population, 




replace the rest with the best species 




from the pool 


Algorithm 2: (jj. + X)RLS 


1 


Initialize population size /j 




repeat until condition fulfilled: 


2a 


select A, species from the population using 




Tournament selection 


2b 


flip exactly one bit per chromosome 


2c 


keep a best species in the population, 




replace the rest with the best species 




from the pool 



Table 1 : Pseudocode for the algorithms analyzed in this article 



Rastrigin and Royal Roads. We also tested its perfor- 
mance on OneMax specifically for this article. 

1.4 Our findings 

The models we derive are complete, i.e. they are func- 
tions of just population size /j, recombination pool X and 
length of the chromosome «, i.e. the actual parameters 
of EAs, though we make some weak assumptions about 
the pairing of parents. 

We derive the expectation of convergence time for 
the population-based elitist EA with a recombination 
operator (1 -Bit-Swap Operator) and mutation-based 
RLS. Our theoretical and computational findings confirm 
that for OneMax the benefit of population is unclear, i.e. 
its effect is not always positive. For Royal Roads it is 
always positive. This problem-specific issue was noticed 
before (see e.g. Figures 4 and 5 in ( |He and Yao, 2 002 )). 

We use two distributions of elite species in the 
population: Uniform(i) and Poisson(l). Since the ex- 
pressions for the expected first hitting time of algorithms 
Ex we have found are quite complicated, we do the 
computational estimation and find some asymptotic 
results as well. 



Tournament Selection Procedure We use this selec- 
tion because it is fairly straightforward in implementation 
and analysis. 

• Select two species xuXj uniformly at random 

• if f(xi) = f(xj), either xi or xj enters the pool at 
random 

• else the species with better fitness enters the pool 



2 ANALYSIS OF ALGORITHM 1 ON 
ONEMAX PROBLEM 

We start with Uniform distribution of elite species 
with parameter 1, which gives the lower bound on 

convergence time, which is due to the assumption on the 
number of elite species needed for the evolution. 

The probability of selecting an elite pair is 
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(2) 



Since we restrict the analysis only to elite pairs, the prob- 
ability of evolution (generation of a better offspring as a 
result of 1 -Bit-Swap) is 

1 /k\2 

where k = Q: § - 1, which is due to the assumption that at 
the start of the algorithm /(a) = § . 

2.1 Uniform distribution of elite species 

We are deriving an upper bound on the probabil- 
ity (and, therefore, lower bound on the expectation 
of the first hitting time). We are interested in the 
probability of evolving at least 1 new elite species 
next generation, i.e. of at least 1 successful swap. 

P( at least 1 new elite species in the population at t + 1) 
= 1 — P(no new elite species in the population at t + 1 ) 

We define Go to be the event that no new species 
evolves over 1 particular generation. The number of elite 
pairs Hj in the population varies from to |, and elite 
species a in the population from 1 to fj. In this regard, 
probability to select a number of pairs given a elite 
species in the population is P se ia- By the Law of total 
probability, 

x 

P(G ) = f i P(G \H J )P(H J ) 

j=o 
x 

= £p(G Q \H ] )f t P(H J \a)P(a) (4) 

;=0 <x=l 



We assume Uniform probability of each number of elite 
species in the population: P(a) = «. 

P(G \H )P(H ) = P(G \H ) £ P(H \a)P(a) 
a=l 

= £ P{H \a)P(a) 



selfj J 



(5) 



to get 1 elite pair: 



and 



P(G \Hi )P(H l ) = ( 1 + 2 ( £ ) 2 ) i £ />(#, | a) 



therefore, the probability of failure given 1 elite pair 
given k improvements so far is 











a- 




(i 











(6) 

For cases {Hj:2<j< |}thelo gic is similar, so the full 
expression for the probability of failure is 

x 

2 

P(G Bk ) = '£ i P(Go\H J )P(Hj) 

-;£(H9T(! 

ft / a 2 (a + 2fri-a)) 2 yY, cc 2 (a + 2(^- a)) 2 \ - 

Interchanging the sums and using the standard binomial 
expansion (s + t) n = ££ =0 

r/ a 2 («+2Cu-q)) 2 \ 

The probability of evolution (obtaining a better species) 
is therefore 

P(G k ) = \-P(G ok ) 

I id /l /«:\ 2 



:ach 0<A:<2 — lwe have 
1 

1- jasrESUi {f 4 - (s -2(^) 2 )(a(a + 2( jU -a))) 2 } 1 



„2X+1 



/' a+1 -E'a =1 {^ 4 -(i-2(^) 2 )((x(a + 2( A( -a))) 2 } 1 



and therefore the expected first hitting time for the algo- 
rithm is 



Ex 



{jj+X)EA ws = E E7 * 
*=0 



% a+ '-E'« =1 {f 4 -(i-2(^) )(a(a + 2(^-a))) 2 } : 



(8) 



Despite having two sums without closed forms, the 
convergence rate of this algorithms depends only on 
the size of the population, recombination pool and the 
length of the string, that is, the real-life parameters of 
EA. Therefore, the model is complete. 



3 ANALYSIS OF ALGORITHM 2 
SOLVING ONEMAX 

For comparison, we derive Et for {p + X)EArls using 
similar approach (Law of total probability + sum of Geo- 
metric RVs). Changes apply mostly to the selection prob- 
ability, as we have no pairs to form: 



(9) 



3.1 Uniform distribution of elite species 

We use the same assumptions of uniform distribution of 
elite species in the population as with the (ji + X)EA\gs- 

Failure event Go is defined in the same way: no 
successful flips in the recombination pool, so the prob- 
ability thereof is defined in a similar way to the one in 
Equation [4] 

P(G ) = £ P(G \Hj) £ P(Hj\a)P(a) (10) 

j"=0 a=l 

Only in this case j is the number of elite parents in the 
pool and goes from to A,. 

P(Hj\a)P(a)=r\ P i el (l- PseI ) x -j 

and therefore (using the same idea with the binomial ex- 
pansion) 

nG 0k )= l -i(ly( x )ipui-p^- j 

" j=0 WI/ \J/ a=l 

i * * a\ tk 



£^-a(^-a)(l-^ 



V " a=l 



(ID 



Unfortunately, the closed expression for this sum exists 
only for specific values of X, so we have to keep it this 
way and later obtain the results computationally. 

P(G k ) = 1 - P(G ak ) = 1 — -L. - afti - <x)(l - - 1 

r K+J a =i n 

So the expected optimization time of the algorithm is 



Ex.. 



(p+\)RLS : 



n—\ n—\ i 



(12) 



As the case is with + A.)EA[gs, this is a somewhat op- 
timistic estimate, since it assigns fairly high probabili- 
ties to high proportions of elite species in the population. 
This is confirmed by numerical estimates. 



4 ANALYSIS OF ALGORITHM 1 ON 
ROYAL ROADS FUNCTION 

We use the setu p for RR problem along the lines of 
( |Mitchell, 1996)> (referred to as #1 in the book).The 
chromosome of length n is split into K blocks, each of 
length M. The fitness of each block is if there are any 
Os in the block, and M if all of the bits in it have value 1 . 
The fitness of the chromosome is the sum of the value 
of the blocks, so it can take values 0,K,2K, . . .MK. We 
index the blocks using index K. Originally this problem 
was designed to test EA's capacity for recombining 
buil ding blocks com pared to other heuristics (for details 
see ( [Mitchell, 19%) ). 

Additionally, we introduce an auxiliary function 
used to measure progress between improveme nts in 
the fitness (the idea is similar to that in, e.g. ( |Doerr| 
|et al., 201%) JHe and Yao, 20 04)), which in our case 
is V K = OneMax(s K ) since both functions achieve the 
global optimum at = M and max/(s) — V s — £ V K = n. 



There is an important difference from the standard 
OneMax problem: unlike it, when parents exchange 
genetic information, it doesn't matter where the infor- 
mation comes from (which segment of the parent), but 
it matters where it is inserted, because it may mean that 
the fitness of the recipient segment has reached M, and 
therefore the fitness of the whole parent increased by the 
same value. 

The second important observation is that of all seg- 
ments in the chromosome there is one, which evolves 
first, denote it Ki (this is only possible due to the 
parameters of the EA in discussion). This means that 
segments evolve in a sequence: Ki,K2, . . . ,K&. 

We pessimistically assume that the best auxiliary 
function value in the first generation is | and fitness 
function is 0. We also assume that the starting value in 
each bin K is y . In the same way as with OneMax, we 
make assumptions about the distribution of elite species 



in the population, rather than their exact or approximate 
number. 

We start with introducing the probability of failure: 

X 

P(G ) = £ P(G \Hj) £ P(Hj\a)P(at) 
j=0 a=l 

where all variables are the same as in (ji + X)EAibs solv- 
ing OneMax: Hj is j'th elite pair in the recombination 
pool X, a is the number of elite species in the population 
fj with both highest fitness and auxiliary function values. 
The selection function is the same as Equation[2j 

a 2 (a + 2(^-a)) 2 



P S el(a) : 



;< 4 



The successful event is defined as evolution of at least 
one more elite species in the population. The number of 
bits equal to left to swap/flip in a segment we use /. 
So now the probability of successful swap in Equation [2] 
becomes 

_ 2 (f ~ l ) (f + i r + Z ) _ (M-2l)(n + kM + 21) 



•swap- n2 - 2n2 (13) 

The auxiliary function for each bin K, V K lies between 
and y — 1 and k between and K — 1 , where K is the 
total number of bins K to fill. The probability of failure is 

(M-2l)(n + kM + 2l) 



Pf — 1 Pswan — 1 



2n 2 



_ 2n 2 ~{M-2l)(n + kM + 2l) 
2rfi 

The probability to fail to improve a bit in a bin given I 
improvements so far is 

pf Gol ) = ly f 2n 2 -(M-2l)(n + kM + 2l) y (\ 
£ ^ (cc(ct + 2^-oc))) y^ {a{a + 2y(n-a))) ^-j 



I 

a=l 



1 



2-1 



^"0 \J . . 

1 V 

= ~'£{l-P,d{a)P„ >ap ) 

P a=l 

Therefore, 



(14) 



P(G,) = l-P(Goi) = 1 - - V {\-P se i{a)P„ ap )i 
PSA 

Expected time until improving the auxiliary function of 
a bin K is 

M , 



Er K = X 



1 



(15) 



to P (G,) 

and, finally,summing over all k from to K — 1 we obtain 
(since G depends on both I and k) 

r-if-i 

Ex/, 



1 



% ((i+X)eAibs ~ ll pir.,, \ 

k=0 1=0 r \ u '.k) 



1 



E E 

to to ^+'-^ =1 (/-(a(a + 2( M -a)))2p m . ap )t 



(16) 



<" 


c(M) 




24e 


4 


41 


10 


w 1.58198 


20 


w 1.58198 


30 


« 1.58198 



Table 2: Values of the normalizing constant, Equation 
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5 ANALYSIS OF ALGORITHM 2 ON 
ROYAL ROADS FUNCTION 



hitting time for the algorithm on RR: 



Ex. 



E E 



1 



-1 



=*V* E 



i 



i=0 „„2». 



(19) 



Just as is the case with OneMax, we present the re- 
sults for population-based RLS on Royal Roads. This 
model is a bit simpler since we do not have to pair the 
parents, and the selection is just 



P{a) se i = 



2ap a(2^-a) 



Instead of Uniform distribution of elite species in the 
population, we try Poisson distribution with parameter 1, 
and normalizing constant 



e _ el>+l) 

2Ua ~ «r^+i,i)-r(>i+i) 



(17) 



since ELof = T^Fir where r(re+l,X,) is incomplete 
Gamma function. The sizes of populations used in the 
computational experiments, the values of the normaliz- 
ing constant are set in Table [2] 



The flip probability is just 



M-21 



flip : 



(18) 



Probability of failure given / successful flips so far is 



P(Goi) = ■ 



x / 

E 1 

j=a 



M-2l\i [X 
J 



2n 



a! 



A /a(2//-a)y'/ a(2/j— a)^-J 1 



c(ji) £ lr a(2/j-a) {M-21\t 



a=l 
£M " 
W 2X a= 



2;; 

M-21\i^ 



2/j 



Therefore, the probability of success is 

P(G,) = 1-P(G ,) 
and the expected time to fill the first bin Ki is therefore 



ET Kl - 



1=0 !_ 



M . 

«P E - 



cWii a -«(2/i-a) 



M-2f 
2n 



Since we have such bins and the probability of success- 
ful sampling does not depend on the number of 1 's in the 
parent (unlike (p+XjEAigs), we obtain the expected first 



6 COMPUTATIONAL RESULTS 

Since the expressions derived in this article do not have a closed 
form, we find them computationally. To test our results, we run each 
algorithm with parameter set (p. X,n) with /j = X almost always for 50 
independent runs, each run was 2000 generations long. The average of 
optimization time is denoted T. Probability distribution used for each 
model follow standard notation in Probability theory: R for Uniform 
and P for Poisson. 

In general, results for Algorithm 1 tend to be better than for Al- 
gorithm 2 and for OneMax sharper than for RR. As we mentioned 
already, this is due to different patterns of dynamics of elite species and 
has to be investigated further. Apparently for both algorithms solving 
RR both Uniform(i) and Poisson(l) distributions give a fairly rough 
approximation that we can improve both statically (using other parame- 
ters) and dynamically (modeling change in the number of elite species). 

The other important result is that the increase of population size 
for both algorithms solving OneMax problem does not necessarily 
result in the improvement in performance, which we showed both 
theoretically and numerically. For RR the situation is much more clear: 
increase in the population always brings about the improvement in 
performance. Both models confirm this quite consistently. 



7 CONCLUSIONS AND FUTURE 
WORK 

We have derived expected running time for EAs based on two differ- 
ent genetic operators on two relatively simple fitness functions. In the 
future there are two extensions that we particularly plan to focus on: 

Approximate results for Equations |8j (T2J [TSJ [19] This 

is the most obvious of developments. Although these equations give 
good estimates for optimization time, and we have found some asymp- 
totic lower bounds, it is desirable to find sharper bounds in the form 
0(g(iA, A, n)). A big problem here are the complicated expressions in- 
volving sums. 

Evolution Of elite Species. This area has seen little focus in 
EA community, and we are keen to develop a dynamic model for 
evolution of species. In this article the model is static, i.e. distribution 
of elite species in the population is fixed (Uniform or Poisson). As 
a result, some bounds, especially for RR, seem to be quite loose. If 
instead of assuming a probability distribution of elite species with fixed 
parameters we study the convergence of the distribution, we can derive 
sharper bounds on optimization time. 



n 


A* 


I 


Ft* 

Algorithml 


"Algorithm! 


Ft* 

Algorithm! 


^Algorithm! 




1 


2 


112.9801 


113.14 


190.7979 


192.12 




2 


2 


144.6145 


218.12 


116.2812 


184.6 


50 


4 


4 


94.7621 


145.4 


73.124 


193.54 




8 


8 


62.5691 


121.86 


48.596 


184.52 




10 


10 


55.4784 


116.98 


43.488 


197.92 




1 


2 


259.8688 


265.84 


449.9205 


418.8 




2 


2 


332.6321 


455.9048 


271.56 


393.39 


100 


4 


4 


215.2445 


314.1 


168.03 


410.39 




8 


8 


139.2885 


267.72 


108.826 


420.416 




10 


10 


122.4884 


266.9 


96.3978 


405.22 




1 


2 


3743.2354 


3682.6 


6792.8 


6715 




2 


2 


4791.3413 


7072.7 


4025.9876 


6630 


1000 


4 


4 


3021.5468 


4574.3 


2413.0033 


6990 




8 


8 


1872.3595 


3866.5 


1481.761 


7016 




10 


10 


1616.4433 


3807 


1283.5502 


6834 



Table 3: Theoretical and computational bounds for OneMax test function with the assumption of Uniform distributions of elite species 
in the population (Equations[8]and 
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<" 
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Ft* 

' " ^Algorithm! 


t Algorithml 


Ft p 

Ij ^Algorithm! 


^ Algorithm! 








4 


4 


145 


315.3077 


64.8084 


672.25 


32 






10 


10 


72.4 


268.2195 


58.124 


504.625 


4 


8 


20 


20 


44.2 


192.2917 


56.175 


334.125 








30 


30 


34.5 


173.5625 


55.1 


221 








4 


4 


570.625 


612.46 


249.959 




64 






10 


10 


279.88 


497.93 


222.565 


820.6667 


8 


8 


20 


20 


153.46 


454.4681 


212.452 


715.92 








30 


30 


112.297 


372.04 


209.373 


663.6744 








4 


4 


2264.36 


1365 


1021 




128 


16 




10 


10 


1048 


1239 


940.999 




8 


20 


20 


570.44 


1091.5 


887.396 


1612 








30 


30 


401.99 


949.4 


871.1131 


1505 



Table 4: Theoretical and computational bounds for Royal Roads test function with the assumptions of Uniform and Poisson distribu- 
tions of elite species in the population (Equations 16 and 19 1. 
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